SMIL: Multimodal Learning with Severely Missing Modality

نویسندگان

چکیده

A common assumption in multimodal learning is the completeness of training data, i.e., full modalities are available all examples. Although there exists research endeavor developing novel methods to tackle incompleteness testing e.g., partially missing examples, few them can handle incomplete modalities. The problem becomes even more challenging if considering case severely missing, ninety percent examples may have For first time literature, this paper formally studies with modality terms flexibility (missing training, testing, or both) and efficiency (most data modality). Technically, we propose a new method named SMIL that leverages Bayesian meta-learning uniformly achieving both objectives. To validate our idea, conduct series experiments on three popular benchmarks: MM-IMDb, CMU-MOSI, avMNIST. results prove state-of-the-art performance over existing generative baselines including autoencoders adversarial networks.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Latent Low-Rank Transfer Subspace Learning for Missing Modality Recognition

We consider an interesting problem in this paper that uses transfer learning in two directions to compensate missing knowledge from the target domain. Transfer learning tends to be exploited as a powerful tool that mitigates the discrepancy between different databases used for knowledge transfer. It can also be used for knowledge transfer between different modalities within one database. Howeve...

متن کامل

Towards SMIL as a foundation for multimodal, multimedia applications

Rich and interactive multimedia applications, where audio, video, graphics and text are precisely synchronized under timing constraints are becoming ubiquitous. Multimodal applications further extend the concept of user interaction combining different modalities, like speech recognition, speech synthesis and gestures. However, authoring dialog-capable multimodal, multimedia services is a very d...

متن کامل

Learning with Missing Features

We introduce new online and batch algorithms that are robust to data with missing features, a situation that arises in many practical applications. In the online setup, we allow for the comparison hypothesis to change as a function of the subset of features that is observed on any given round, extending the standard setting where the comparison hypothesis is fixed throughout. In the batch setup...

متن کامل

Modality Convergence in a Multimodal Dialogue System

When designing multimodal dialogue systems allowing speech as well as graphical operations, it is important to understand not only how people make use of the different modalities in their utterances, but also how the system might influence a user’s choice of modality by its own behavior. This paper describes an experiment in which subjects interacted with two versions of a simulated multimodal ...

متن کامل

Modality Theory: Supporting Multimodal Interface Design

Modality theory addresses the following general problem of mapping task domain information into interactive multimodal interfaces: given any particular set of information which needs to be exchanged between user and system during task performance in context, identify the input/output modalities which constitute an optimal solution to the representation and exchange of that information. This pap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i3.16330